WO 98/04586 



PCT/GB97/02046 



s t :; 



Morphological 
marker map 



RFLP map 

(Carisbcrgtl Mto x 
Granncnloso Zwcizetligc mlo-t1) 



AFLP map 

(log rid MIo x BC Ingrid mk>-3) 



Ccn -1 



5.0 CM 



< 



/28 



t.OcM 



CM 

e 

! 



33 



4.0 



to" 

I 
CD 



3: 



cn 
a. 
< 

f 



\ 



23 



0.7 



2.4 



£ 

X 

CD 

t" 



0.24 



0.1 



0.30 



0.1 cM 



-s. <X> 



L Tel 

2.3 cM 



O - 



to 
<r> 
c*> 

O 
CD 

< 



J 



0.3 0.4 cM 



e 

CL 

tn 

t 



cM 



YAC YHV303-A6 



— CM 

EH £ 

a. cl x 

cqco CD 

I ( t 



CT> 

6 
a. 
as 



rT"jj 



100 kb 



BAG F15 



cr 
8 

L. 



£ *s -c i 

■ 1 % ! 1 f 



1 '. 



v. VJ VJ 

cm r% 

a: 



t > i n 



— — (T 



Sequence contigs 



S Kb 



Mlo gene structure 




m/o-S I mto* 1 3 
mfa-G I mlo- 17 
mto-0 



mlo- I mk>* tO (&) tnlo-7 
mto-4 (6.) 



I 

mlo -26 



to 



1 1 



12 



I 

m/o-3 (&/ 



100 bo 



Figure 1 



WO 98/04586 



PCT/GB97/02046 



2/28 

MSOKKGVP A R E L P £ T P S W A V 
ATGTCGGACAAAAAAGGGGTGCCGGCGCGGGAGCTGCCGOAGACGCCGTCGTGGGCGGTG 60 

AVVFAAMVLVSV LMEHGLKK 
GCGGTGGTCTTCGCCGCCATGGTGCTCGTGTCCGTCCTC ATGG AACACGGCCTCC ACAAG 120 

LGHWFQHRHKKA LWEALEKM 
CTCGGCCATTGGTTCC AGCACCGGCACAAG AAGGCCCTGTGGG AGGCGCTGG AG AAGATG 180 

KAELMLVGFISL LLIVTQDP 
AAGGCGGAGCTCATGCTGGTGGGCTTCATATCCCTGCTCCTCATCGTCACGCAGGACCCC 240 

IIAKICISEOAAOVMWPCKR 
ATCATCGCCAAGAT ATGCATCTCCGAGG ATGCCGCCG ACGTCATGTGGCCC7GCAAGCGC 300 

GTEGRKP S K Y V D YCPEGKVA 
GGCACCGAGGGCCGCAAGCCCAGCAAGTACGTTGACTACTGCCCGGAGGGCAAGG7GGCG 3 60 

LMSTGSLHQLHVF I F V L' A V F 
CTCATGTCCACGGGC AGCTTGC ACC AGCTGC ACGTCTTC ATCTTCSTGCTCGCGGTCTTC 420 

H V T Y S V IT IALS R L K^MRTWK 
CATGTCACCTACAGCGTCATCACCATAGCTCTAAGCCGTCTCAAAATGAGAACATGGAAG 4 80 

KWETETTSLEYQFAMDPARF 
AAATGGGAGACAGAGACCACCTCCTTGGAATACC AGTTCGCAAATGATCCTGCACGGTTC 5 4 0 

RFTHQTSFVKRH LGLS STPG 
CGGTTCACGCaCCAGACGTCGTTCGTGAAGCGCCACCTGGGCCTCTCCAGCACCCCTGGC 600 

IRWVVAFFRQFFRSVTKVDY 
ATCAGATGGGTGGTGGCCTTCTTC AGGCAGTTCTTC AGGTC AGTC ACC AAGGTGG ACT AC 660 

LTLRAGFINAHLSQNSKFDF 
CTGACCTTGAGGGCAGGCTTCATCAACGCGCATTTGTCGCAAAACAGCAAGTTCGACTTC 720 

HKYIKRSMEDDF KVVV'GISI* 
CACAAGTAC ATCAAG AGGTCG ATGG AGGACGACTTCAAGGTCGTCGTCGGC ATC AGCCTC 7 80 

PLWGVAILTLFLD INGVGTL 
CCGCTGTGGGGTGTGGCG ATCCTC ACCCTCTTCCTTG AC ATCAATGGGGTTGGC ACGCTC 8 40 

IWISFIP LVILLCVGTKLEM 
ATCTGG ATTTCTTTC ATCCCTCTCGTG ATCCTCTTGTGTGTTGG AACCAAGCTGGAGATG 900 

I I MEM A LE IQDRASV I KGAP 
ATCATCATGGAGATGGCCCTGGAGATCCAGGACCGGGCGAGCGTCATCAAGGGGGCCCCC 9 60 

VVEPSWKFFWFH R P D W V L F F 
GTGGTCGAGCCCAGCAACAAGTTCTTCTGGTTCCACCGCCCCGACTGGGTCCTCTTCTTC 1020 

I H LTLFQNAFQM A H F V W T V A 
ATACACCTGACGTTGTTCCAGAACGCGTTTCAGATGGCGCATTTTGTGTGGACAGTGGCC 1080 

TPGLKKCYHTQIGLS I M K V V 
ACGCCCGGCTTGAAGAAATGCTACCACACGCAG ATCGGGCTCAGC ATC ATGAAGGTGGTG 1110 

VGLALQFLCSYMTFP L Y A L V 
GTGGGGCTAGCTCTCCAGTTCCTCTGCAGCTATATGACCTTCCCCCTCTACGCGCTCGTC 120 0 

TQMGSNMKRS IFO £ Q T SKAL 
ACACAGATGGGATCAAACATGAAGAGGTCCATCTTCGACGAGCAGACGTCCAAGGCGCTC 12 60 

TNWRNTAKEKKKVROTOMLM 
ACCAACTGGCGGAACACGGCCAAGGACAAGAAGAAAGTCCGACACACGGaCATGCTGATG 1320 

A Q M IGDATPSRG S S P M p 3 R G 
GCTCAGATCATCGGCGaCGCAACACCGAGCCGAGCCTCCTCGCCCATGCCCAGCCGGGGC 1300 
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Figure 2 (Continued) 



SS P V H L LH KGMGRS DDP'QSA 
TCATCACCCGTGCACCTGCTTCACAAGGGCATGGGGCGGTCGGACGACCCCCAG AGCGCG 1 4 4Q 

PTSPRTQQEARDMYPVVVAH 
CCCACCTCGCCAAGG ACCCAGC AGG AGGCT AGGG ACATGTACCCGGTTGTGGTGGCGC AC 1500 

PVHRLNPNDRRRSASSSALE 
CCGGTGC ACAGACT AAATCCT AACG ACAGG AGGAGGTCCGCCTCGTCGT CGGCCCTCGAA 1560 

ADIPSADFSFSQ G* 
GCCGACATCCCCAGTGCAGATTTTTCCTTCAGCCAGGGATGA 1602 
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0/28 FIGURE 5 



292 GCGGAGCrrr.ATGCTX^TaaarrT^ 341 
If -MM f I I I I I i I I 1 M I M I : J I II I I M Nil (II | f ( 
80 GCANAGCTGATGCTGCTGGGCTTCATNTCCCTGCTTCTCACCGTGGCACA 129 

342 GGACCCCATCATCf^r:AAf?ATAT(^ATr!T(^(^ffAf^ATf?r!n(^r!f?APr^r > ^ 391 

I I J I IMI Il(l:ri illlll II (I 11 I f 1 f 1 ( f 
130 GGCGCC. . .CATCTCCAANATCTGCATCCCCAAGTCGGCTGCCAACATCT 17 6 

392 TCrTfiGCcrrrf^A Af?c^r^ArmAf^crgr . AAannniar^AaT^rrzT 440 

IN III 1 i I I I I I I I : II I 1 fill : I II { || 
177 TGTTGCCGTGCAAGGCAGGCCNAGATGCCATCGAAGAANAAGCAGCAAGT 226 

441 TGACTA(rmrrrraAraTr:*r^ 490 

I 1 : I : I M I I I I Illlll I 1(11:! 
227 GGTCNCCNGTCC . TTGGCCGGCGCCGGCGGCGGGGACTACTGCTCNAAAT 275 

* • • . . 
491 TGATGAAGAAATCAATACC GAACTTTTTCTTGTTTTCT 528 

I 11111:11 III f f : : : 

27 6 TCGATGTGAGAATAACNCCAGCTGCCGGCAAGCACAACCTCGATNCNATN 325 

• • * * , 
529 TCTG ATTGTCGTCTTGGCTTGGCTTAATTGGTGTGTGTGTGTGTGTTTGC 578 

I I = M I I (Mill I I I 1 I i I I 1 i 

326 ACTNATT TAACTATAATTGATTTTTCTTGGGTTTTCTGC 364 

579 iMKiKreAAfi(7irayxyTC 

i I H I I I 1 I I I I I II 1 INN f 111 ! I I M II II 1 I If I [ 

365 AGGGCAAGGTGGCGCTGATGTCGGCAAAGAGCATGCACCAGCTGCACATT 414 

629 TTCATCTTCGTGT!Tr!f^r^TCTTr;nATf;TC Af:r!T A C A<?C(?TC ATf! A PP 678 

M I I I I M II I M I I f I If I I f If I II I I 1 I i I II f I I I I I 1 I i 1 
415 TTCATCTTCGTGCTCGCCGTGTTCCATGTTACCTACTGCATCATCACCAT 464 

579 AGCTCTA AffT!nr,TrTTR A A^TCAP^rTTT^TTrT ! TCTTCTTCTT 723 

1 I II I I 1 I 1 II I I I I I 1 I if Mil I I 1 If 

465 GGGTTTAGGGCGCCTCAAAGTGAGTTTGTCGTTCTGTCCCTCATGCACAT 514 

724 CTTTTACC GCACGTCTGTCTGTCAGGCGTACCTACCTGTTCA 765 

I I I I I III: fill III I I 1 f I I 

515 GTTTTCTCTAGTTCTAGCAANATTGTCAGTCCTyCAAATGGATTGTTTCG 564 

766 TCAGGCTTGAGTAAAACTGTTCCATAATCTGC TCCGGCATAA 807 

II M I I I N llll Ill I M I 1 
565 ACA AGAAACCCAATTTATTAATTTGCCAGTTAAATATATAATAA 608 

808 TCCTCTCCTCCTG CAGATGAGAAC.ATCttAAGAAArcttAGzr^fzzcz 853 

I - M II I I II 1 I I I I I 1 I I I I I M I I I I II If 

60 9 TTGATCTTTCTTGGTTTTAGATGAAGAAATGGAAGAAGTGGGAGTCACAG - 65 8 

854 ACCACCTrrmvyrA at arr kcyrrc.cx** a at^tp AnnArr.r.rm ar^^ 903 

_ llll I I I I I I I I If II I I I I I If I III I If 

659 ACCAACTCATTGGAGTATCAGTTCGCAATCGGTAGTG . AATTAA 701 

904 CAATCTCCC ... CTTCTTCGAAACCAAACC TGATGATCCATTTAAA 946 

^ M H 1 H I I I II I I I I llll I I I I I I MM! 
702 GAATCTCCCTAACTATTTCATTTCAGAACCTTTATGATAATGTCTTGAAA 751 

947 GACGCAGGCACGATCAGAGTGAGTGAACTGATGTATGTTCATTTTTTGTG 996 

Ml f Mill I I 1 I It If I 
7S2 GAGGAGGAGCAAATCAG . CTGAAAAAT ATGATCGA 785 

997 TCCTTTCAGATCfrrrcr-Arf^TTC^ 10 4 6 

M I I M I I I I I I If I I III II II II II 1 I M II II II II M I If 
786 TCCATGCAGATCCTTCACGATTCAGGTTCACGCATCAGACGTCGTTCGTG 83S 
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1047 AAcrarca cfrrrarv r.r ^r^rnA(^AcncrTTO(^ 1093 
fill! II 1 1 I I I I I 11 I II II M T I II I f I I I I I I I II 

836 AAGCGGCATCTGGGATCATTCTC^GCACCCCTGGGCTCAGATGGATCGT 885 

1094 GAGTTTTTT AGCTTCTT ATCTGCCCCTCATCTGTGTGT AATGTT 1137 

I I I I I I INI II III IN I 1 II 

886 GAGTTATCAATCTCCGAAT ACATGCTTGTTTTTTATTCTTGCA 928 

*<••** 
1138 . . TGGCGTA TGGAGTCAGGTGATTT ACCTT 1165 

INI M I I (I I III I I 

929 ACTGGCCTAGCTGTTCCAATTCAATCCATATTTTTTGAAAAAAAAAATAT 978 

1166 ^CTCTftATCTTTCTrrrarr^ 1215 
I II 11(111 I I I I I I I I II M I I I I N I I I I I f I I I 

97 9 TCATGCCGTGTTTG TTGTTAGGTAGCATTCTTCAGGCAGTTCTTT 1023 

1216 AC^TCAGT CArcAAGGTGftArTAr^^ 1265 

I 1 I I M [ I I I I N I I II I 1 I I ( I I I I M II 11 I (I M 1 1 I M M 

1024 GGGTCCGTCACCAAGGTGGACTACCTGACCATGCGGCAAGGCTTCATCAA 1073 

* m • • * 

1266 GGTACGTGC CTCCCCTTCTAGCTCCGCCATTGCTGCCGCGATGTAG 1311 

III I I I Ml I ( Mill f 1 111 II 

1074 TGTATAT ACTAATCAAACCTG ACCAATTCAACATTGATG ATGC . AAACAG 1122 



1312 CAGCAAAGCTTCT CAAGTTATCCTTCTGACGCTAAAGTTCCCA 1354 

I I I I I 1 I I (III I I I !IM III I 
1123 AAGACCAGGTTTTTTTTTTCCGAGTTGTGCAT . TGAAGTTAATG 1165 

1355 TGTTTTTTCCT^AAATTATTCT^f^ GnCG . CATTTCTTflT AAA ACAflT! 1403 

Mill I II II 1 I I I I I I 1 I I I 1 I I I I I I I I I 1 Ell 

1166 . GTTTTAGCTTC . . . TTCTCTTTTGCAGGCGCCATTTGTCGCAGAATAGC 1211 



1404 AACTTC^A CTTr^ArAAGTArATrAA nAr^TrffATGGAGGAraAnTTrAA 1453 

I 1 I I I I I I I I I 1 I I I I I 1 I I II I I I I I I I M I I 1 I I I II I 1 I I I 1 I 1 
1212 AAGTTCGACTTCCACAAATACATCAAGAGCTCTTTGGAGGACGACTTCAA, 1261 

1454 rOTCnTC^TCf^ATHAraTACG^ 1503 

I I I II I I I M I I I 1 I 11 I I MM I I I 

12 62 - AGTTGTCGTTGGCATCAGGTCCG TCCTCGCTTT 1294 

1504 CACCCCATGGATAGATTTTAACAATTGCTGTCAGGTTCCACATGATAACA 1553 

m Mill i ii i i- iii mi i 

1295 ATTAATTATAGGA CTCTTATATTCAACATTTTTTTT 1330 

1554 ATATACTATGA . ACTTGGTCTTTGCTCCTTGTCCTTG CACGATCA 1597 

Ml I I I I 11 MM I MM 11 fill 
1331 ATAAAG AAACATATTT AGTCT CCAGTTGTGTATGTGTATGTGGATCT 1377 

1598 TGA^CATTTG^trrftTTTTCf^ 1647 

IIIIIIIIIIII MI Ml M M M M M M If M II I Ml 
1378 TGACACATTTGG . CTCGTTTTGCAGCCTCCCTCTGTGGTTCGT CGGAATC 1426 

1648 CTCArrc TCTTrrrrTttArA^ 1697 

M 11 M M It II Ml I MM M II M II I M 

1427 CTTGTACTCTTCCTCG AT ATCC ACGGT A . . ATCCTTGTCCT ATTT 1469 

1698 CTCTATTGCMTGCAGCTAAATAAAACACTTGCAATTCGTCTCGTGATCA 1747 
I III MI Mill I M M II M I MM 

1470 CATTCTTTTTTTTACTCTCAAAACCTTGTTCTGAATTGGtCTTATAATCA 1519 

• • * • . 

1748 CCGCTCATTTTTCAAgCATTTgTTTTTCTACTCATA GGGGTTGGrArOCT 1797 

M (Mill I I M I I I 1 I MM II I II It M 

1520 CCATCGATTTTTTTTCAACTT . TT TCCCCGC GTGT AGGTCT TGGCACACT 1568 

1798 CATrT(^ATTT<rTTTr^Tr rrT ^ r ^ T< ^ TAAGTag - AGATTTCTPir AT 1845 

II I M II I I 11 1 I I M I I I 1 Mill M I Mill I 1 I . 
1569 TATTTGGATCTCTTTTGTTCCTCTCATCGTAAGAGCGAAATTTCCCCTGT 1618 
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1846 CGAAAGCAACAGCAAACCCAATT » TGATCGCAAT 1878 

I I I | | (MM 111 I 1 I I IN III 

1619 CCAftAGAAACAGTTAACATAATTAATTATGCTTTAATTTATCATGAAAAT 1668 

* 

1879 GG AAACCCACACCTAATATTAACTCAAAATGTCAATTGTCGGTGCGTCTT 1928 

It I I Ml III! I M III HI I 

1669 TAATATGATCATATAACTAATGAACAAACATTCA . . TGTG AATGCCACCG 1716 

1929 CCTCAAGAfl ATCHTT^TGTGTGTTfflS A Af!C AA Gf?TGG AG ATG ATC ATC AT 1978 

llllll MMII IMM MMMM MIMI I MM! 

1717 TTGTCTCAGATCGTCTTGTTAGTTGGGACCAAGCTAGAGATGGTGATCAT 17 66 

1979 gg ag ATGGf; r:r^rGG a g a tcc a gg accgggcg a GCGTC ATC ft AGGGGGCCC 2028 

I M If II I I M Mill M II M MM I M I I I MM M I 
17 67 GGAGATGGCCCAAGAGATACAGGACAGGGCCACTGTGATCCAGGGAGCAC 1816 

2029 GCGTGGTCGAGCr^AGC A AP A AGTTC!TT(^GGTTCCACCGCCCCG ACTGG 2078 

I MM M M MMMMM IIMMMM M1MM MMII 
1817 CTATGGTTGAACCAAGCAACAAGTACTTCTGGTTCAACCGCCCTGACTGG 1866 

2079 GTCCTHTTf^TTr! AT AC ACIftTG ACGTTGTT 2107 

Ml I M M I M II M I I M II I M 
1867 GTCTTGTTCTTCATACACCTGACACTCTTCCCATGTACATGTTTAAAACC 1916 



2108 rr AGA ACGC .GTTTCAGATGGrGrtATTTTG 2136 

M I II M M I M M M M M I I It M I 
2017 GACGGACGGATCGATCATCACCAGAACGCATTTTCAGATGGCGCATTTCG 2066 

2137 TGTGGAC?Af?TG GTACGCCAC CGATGAACTTGTCAGTT 2173 

I MM! II Ml M M I M MMMM 

2067 TATGGACTATGGTGTGTATGCTACTTGCTTAGTTGTTGCCATTATCAGTT 2116 

2174 AACATGGGTGTCA. . .AGGCACCGAGTGCCGCTGATGA 2208 

II I Mill I MM M MMII m _ 

2117 CTTAAGCAAATTAAGTGTGATGCATGCACTGA CTAATGAGACAA 2160 

2209 ACTGCTCTGACGGAGATTTACTTGTGTTGT AGGCC 2243 

Mill I Mt I MM MM MM 
2161 AAAATGACACAGCTTGTTCATCGATCTGGTTGTTTTGTGTGTGACAGGCA 2210 

2244 ACfiCrrraOTTGAAGAAA T<^ 2293 

MMII MIMIMMM Ml III IMIIMMM I 

2211 ACACCTGGTCTGAAGAAATGCTTCCATGAAAATATTTGGCTGAGCATCGT 2260 

2294 GAAr^f ^(^GGry^ A<yriv^ 2343 

I I I I I II M M I M M I II I II- M II M II II M II 
2261 GGAAGTCATTGTGGGGATCTCTCTTCAGGTGCTATGCAGCTACATCACCT 2310 

2344 Trccrarrr^ArGCf*^^ 2389 

MM M I II II M M M M M M II II I M II I II It 
2311 TCCCGCTCTACGCGCTCGTCACACAGGTGAACAAGCCATTCACAAA 2356 
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FIGURE 6 



295 GAGCTCATGf:TraTra^TTnATATPrcTCCT^ 344 

I I I M 1 : I I t I I I I 1 I | I I t M i ( M M M 1 I I 1 I I f 1 1 I I I I ! I I I M I 
1 GAGCTCNTGCTGGTGGGCTTCATATCCCTGCTCCTCATCGTCACGCAGGA 50 

345 ccccATCATrnrrAAr:ATATr^ATCTrr:f?A^ATGCcnrraArnTrATr:T 394 

If I M ill I I I MINIMI MM Mil MM MM 
51 TCC . . . CGTCTCCAGG ATCTGCATCTCC AAGGAGGCCGGC G AN AANATGC 97 

395 GGCCCTGC aag . . ccraac ArcftAGtWrfrsPAAGrrp a 430 

I I M 1 I I I : (IN ( M M I M 1 M 

98 TCCCGTGCAAGCCTTACNACGGCGCCGGCGGTGGCAAAGGCAATGACAAT 147 

431 ttAAWAaaTTGACTAC-TGCttarS 455 

MM M : I M 1 M I 
148 CACCGGAGGCTTCTCTGGCTCC AAGGCGANAGCG AN ACCCACCGCCGGTT 197 

456 GGTGAGCAGCAGAGCCCGGACCAG 479 

II if If I I! M I : ! 

198 CCTG . GCTGCCCCGGCCGGANTGG ACGTCTGCGCCAAACAGGTG AGCACC 24 6 

* * * * * 

480 CAGCTTCACGATGATGAAGAAA . TCAATACCGAACTTTTTCTTGTTTTCT 528 

i : I (1:1 I I I MI 1 : M 1(11 ill M 
247 TANCGTCNCCACAAACCACAAACTANCTAATGAGCATGGACCTGAATTTC 296 

529 TCTGATTGTCGTCTTGGCTTGGCTT AATTGGTGTGTGTGTGTGTGTTTGC 578 

1 I I I I I M 1 M II I I I M 1 I I II lit! 
297 TTCTCTTCTTGGCTTGGCTTGACTAAATTGGT TGTGC 333 

579 AGGGCAAGGTGr^c^TrATGTrrArr;^nAncTTGnArrAnr TGCACGTr: 628 

I I I t (I I I M f I M I I M M : : M f I Ml 11 1 M It MMM I 
334 ACGGCAAGGTGGCGCTGATGTCNNCGGGAANCATGCACCAACTGCACATA 383 

629 TTCATCTTCGTGCTCC^GGTnTTrrAT^TCArCTACA^^Tr ATCArCAT 678 

I J I ! I 1 I I I I I II II 11 M I I t M 1 Ml M I 1! M f I M f I I If 
384 TTCATCTTCGTGCTCGCCGTCTTCCACGTCTTGTACAGCGTCGTCACCAT 433 

• m • - . 

679 AGnTcrrA AGCCCTrTrAAA ^TnAnrrTTTnrTTPTTrTTrTTrTTrTTTT 728 

I I M I M I 11 M M M i M II I ! II 
434 GACCCTAAGCCGTCTCAAAGTGAGCATCAT ACTC 467 

* • * • • 

729 ACCGCACGTCTGTCTGTCAGGCGT ACCT ACCTGTTCATC AGGCTTGAGTA 778 

M I ( M I M III III I I i 

468 GAGCTGTTTGTCAATAATCCTT . . . GGTTTCCAATCCAATTCCA 508 

• • • • m 

779 AAACTGTTCCATAATCTGCTCCGGCATAATCCTCTCCTCCTGCAG ATGAG 828 

H III 1 I I II M M I I t I ! M M It II 11 

509 AAGCTGGCACTGATCCTGCTCCGG CTTCCTGCAGATGAA S47 

829 AACATGGAAGAAATGGGAGACAGAGArCACCTCCTTGGAATACCAGTTrG 878 

I I I I f I I M I M M I I Mill! II II (Mi M I I M I M 
548 GCAATGG AAG AAGTGGG AGTCGG AG ACCGCCTCGCTGG AGT ATC AGTTCG 597 

879 CAAA1GGTCAGGATCCCCCACTCTGCAATCTCCCCTTCTTCGAAACCAAA 928 

I I I I II I I II II I I II I 1 Ml 

598 CGAATGGTCAG CTTCAACTTTTCTTACTGAAA 62 9 

* * * * 9 

929 CCTGATGATCCATTT . . . AAAGACGCAGGCACGATCA GAGTGAGT 970 

MMM Mill I I I If M M It I M M ( I M 

630 CCGGATG . . . CATTTACAACAAACGCACGCACG ATC AATC ATCAC AGTGT 67 6 

971 GAACTGAT . GT ATGTTC ATTTTT TGTGTCCT . TTCAG ATCP THPfl^fyj 1016 

II I 111 I I I M II 11 II I I I I I I Ml 

67 7 G AGCCG AT ACGTTG AACCCG ATTG AAATCCTCCGC AG ATCCC ATCGCCGG 72 6 
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1017 TTCCraTTrArrc-ArrAGAcnrcGTT . cotka Ar^c^CAre^rrare-rrrr 1065 
1 I II I 1 I I I I I I I I M I M f (It! MM f|f | | |f II I ! M I I I 
727 TGCCGGTTCACGCACCAGACGACGTTGGGTGAGGCGGCACCTGGGCCTCT 776 

1066 CCAGCArrr!rTf^rATrAnATr^r;TnCTf;AaTTTTTTAf^'rf^r*rn'rr' T g 1115 
I N I I I I I I I Ml M I M M I I I 
777 CCAGCACCCCCGGCGTCAGATGGGT 801 



1166 GCCTGTGATGTTTGTTGCCTTGTCA^T(^frrTTTTrAf^rAnT^r-r Tr 12 i 5 

I I I I M 1 I M [ M I M I I I M I I I I 
802 GGTGGCCTTCTTCAGGCAGTTCTTC 82 6 

1216 hGGTCAGTCKCr.AAttTttkcTAac^^ 126 5 
f Ml II I I I I I M M II I I I M I I 1 I I I M II IIKIMIill 
827 ACGTCGGTGACCAAGGTGGACTACCTGACCTTGCGGCAGGGCTTCATCAA 87 6 

1266 CGTACGTGCCTCCCCTTCT AGCTCCGCCATTGCTGCCGCGATGTAGCAGC 1315 

877 c 877 



1366 CAAATTATTCTGCGCAGGCGCATTTnTC^nAAAAr AGrAA^TTr^Ar^Tp 1415 

1 I I I 1 I I II I 1 J I I M I I 1 I II I I M 
878 GCGCATCTCTCGCAGGGC AACAGGTTCG ACTTC 910 

1416 CACAACyrACATCAAGA^TCGATGGA^ACGACTTrAA^CTr^TrnT^ i 46 5 

I M M II I 1 I M I M I M M 1 I II 1 I II I I II I M I I II 11 I M I I I 
911 CACAAGTACATCAAGAGGTCGTTGGAGGACGACTTCAAAGTCGTCGTCCG 960 

1466 CA1CAGGTACGTTCCATTCCTTCCTCTGCAC !cACACCACAC 1506 

M I M M M M M I I M I I I I M M I I I II ( Ml 

961 CATCAGGTACGCGCCATTCCTTTCTCTGCACAAATTAATACATCCACCAC 1010 

1507 CCCATGGATAGATTTTAACAATTGCTGTCAGGTTCCACATGATAACAATA 1556 

i M i : I M M (I (J: M I M 

1011 CACATANGTAG AT AGATAGA . TCGATANATANATTA 1045 

1557 T ACTATG AACTTGGTCTTTGCTCCTTGTCCTTGCACGATCATGACACATT 1606 

nt%Ae 111 I I I I II I I I I I I I 1 M I I M M I M 

1046 TAG • AAGTGCCGGTACGTACGTACGTCTCAT . . . ATG ATCTTGAC ACATC 1091 

1607 TGGCCTGTTTTCGCAGCCTCCCre^ 1656 

M III II MM Ml Ml Ml M M M I M M I MM 

1092 TGTCCTCTTGCCGCAATCTCAAGCTCTGGTTCGTGGCGGTCCTCATCCTC 1141 

1S57 TTCCTTGACTATP a ATfyyr ATfysAcr*TTr > Tpr t TPTrr^Tri»r>r ATT ^ 170 5 

1 I I I I I M M I II II M If 1 M I Mil 1 I 

1142 TTCCTTGATTTCGACGGTAGCCGCCTTGTCCATGCCCTGCTCGCCCTCTC 1191 

1706 CTTTGCAGCTAAATAAAACACTTGCAATTCGTCTCGTGATCACCGCTCAT 1755 

M It M III II I I I II I M I M 
1192 CTCCGCTTCTCTCC AT AATTTGTG . AACTTGTCCCGT AT 122 9 

1756 TTTTCAACCATTTCTTTTTCTACTCATAG GGG . TTKftP APKCTCATPTCp 1804 

1 IN II II I I II II I M 1 If II I It M I M 

1230 ATAACCACACCACCGTCGTCTTCTCGCAGGGGATCGGCACTCTTCTCTGG 127 9 

1805 ATTTCTTTC ATrrrTPTrrHTf^* AffittPafimTrTrrtT^ft a a^Caa 1854 
I I M I I II I I M II I M M M M I M M I 

12 80 ATGTCCGTGGTTCCTCTCGTGGTAAGTCCA CAATTTGAATAGA 1322 

* * • - . 

1855 CAGCAAACCCAATTTGATCGCAATGGAAACCCACACCTAATATTAACTCA 1904 
Ml M I II I II II II I M I M II 

1323 CAACCTGTCCAATTGTGATGTACAGTACCTCCAAACTTAA TTA 13 65 
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1905 AAATGTCAATTGTCGGTGCGTCTTCC TCAACAG ATHfrTPTTGTGT 1949 

1 ( I i M I I I f || I M I I I II M I I I I I M II I I 

1366 ACATGTC ATTTGCTGAT ♦ . GTCTTGCGTGTAACATTAGATCCTCTTGTGG 1413 

1950 CTTraAAC CAAfOTraAftATftATrATr aT^ 1999 

I 1 1 I i I I II I I I I I M I I I I f I 1 I I I I M II M i I M II I : I I | M 
1414 GTTGGGACCAAGCTGGAGATGGTGATCATGGAGATGGCCCAGGANATCCA 1463 

2000 CttACCCXttGAttGrr.A TC-AACZXlG^ 2049 

I I I I I I I I I I I I M 1 I I I I I I I i I 1 I I I I I f I I I I M 1 I I 1 ! I 
1464 TGACCGGGAGAGCGTCGTCAAGGGTGCTCCCGCCGTCGAGCCCAGCAACA 1513 

2050 ACTTCTTrT<?crrTPrA rc 2059 

in 1 1 1 1 (i 1 1 1 1 1 1 1 1 ii ii ii 1 1 1 ii i ii 1 1 1 nil inn 

1514 AGT ACTTCTGGTTCAACCGGCCTG ACTGGGTCCTCTTCCTCATGC ACCTC 1563 

2100 ACGTTr?TTr:rAr;AArf^ CTTTCA(?AT ^nrr^ATTTTnTc;Tnr?ArAaTr:f;T 2149 
M I || | | | | | I | | I M | | | M | | |[ | 111!! II i 1 I I I I I M I i I 

1564 ACACTCTTCCAG AACGCGTTTCAGATGGCTCATTTCGTGTGGAC AGTGGT 1613 

• * • • 
2150 ACGCCACCGATG AACTTGTC AGTT AACATGGG 2181 

I I : I I I I II I I I I I I I I I : I 
1614 A CNT ACAAGT ACTTGTC ACTTCACTTANGCT AACTCC AAC AAACG AA 1660 



2182 TGTCAAGGCACCGAGTGCCGCTG ATGAACTGCTCTG ACGG AG 2223 

MM Mill! 1 I I 1 I Ml 

1711 GACACAAAACTCAATCCAACGCGCGGTAGCAAACGAACGTTTTTCCGTAC 17 60 

2224 ATTTAC ! TTG 2232 

I I I I Ml 
1761 GTTTTCGTCCGCTTTCGCCCCATCCCAGCCCAAATTCGTTGACGTTGTTG 1810 

2233 tgttgtagtct: ACGr.rr.GGr.rTG a ah a a atgtt acc ac acgc ag Avr.GGG 2282 

I I I II M I M II M II I I I! M I M I M M I M I I I I I I 
1811 CATCGCAGGCCACGCCCGGCTTGAAGAAATGCTACCACGAGAAAATGGCA 1860 

2283 CTftAf^AT CATftAAraTCOTC^ 2332 

I M I M I I Mill II I Mill II (I . MM M I Mill 
1861 ATGAGCATCGCCAAGGTCGTGCTGGGGGTAGCCGCCCAGATCTTGTGCAG 1910 

2333 CTATATc;ACCTTcrrrrrrfrrArrc 2382 

: M I! M II I I M M : I M II M I I M I I I 
1911 NT AC ATC ACCTT CCCGCTHT ACGCGCTCGTC AC 1943 



2433 AATCATCTGTGTGTGCTGGPTTTGT ATGCAG ATGGG ATC A A ACATG A AG A 2482 

II II M 1 1 I III lllllllll 
1944 GCAGATGGGCTCACACATGAAGA 1966 

2483 GGTCrATrTTPGArGAGC AGACGTrCIAAGGC: . GCTr.Arr.AACTGGrGGA A 2531 

I II : I I I II I M 11 M I II I II I I II I I M II M I I M M I 1 I 
1967 GAAGCANCTTCGACGAGCAGACGGCCAAGGCGGCTGACCAACTGGCGAAA 2016 



2532 rACGGrrA AGGAGAAGAA^AAAGTr rnAGACACGGACATGCTGAT^rTr 2581 

f M II I I II M I I I M M I 1 I II I 11 I II! II II I It M M I I 
2017 GATGGCCAAGGAGAAGAAGAAGGCCCGAGACGCGGCCATGCTGATGGCGC 20 66 

2582 AGATGATCGf^GArr^AArArcGAGrrGAGGCTr:GTrGrrGATr;rrr;Ar:r 2631 

Mill Mill III II 11 I I II II II : M II I M 
2067 AGATGGGCGGCGGCGCG ACGCCG AGCGTCGGCTNGTCGCCG 2107 
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2632 r^c^TCATrftrrcp^ 2681 

I I f t 1 f f f 1 I I I 1 M 1 II 1 K INN! II 
2108 GTGCACCTGCTCCACAAGGCCGGGGCGCGGTCCGA 2142 

2682 C-GArttc^Aa *r^r^c;rr^ 2731 

I I I M I 1 I I I I I I I I 1 I 1 I ! f I 1 f I I I M I M I I II 
2143 CGACCCCCAGAGCGTGCCGGCGTCCCCGAGGGCCGAGAAGGAAGGCGGCG 2192 

2732 ACATCT^ccrrCTTc^^ 2781 

1 Ml III MM II II H HI! 

2193 GC GTGCAGCATCCGGCGCGCAAGGTACCTCCTTGT 2227 

2782 CACAraAraA( 3 yirare^ 2831 

Mi I I I 1 I II I I 11 1 I I If M Hit Milt I I II M M M 1 
2228 GACGGGTGGAGGTCGGCCTCGTCGCCGGCGCTCGACGCTCACATCCCCGG 2277 

2832 TOC AftATTTTTrcTTP A^f ' r Ar^ATGAGACAAGTTTCTG 2871 

M M M I M I M I M M I I M M 1 M 1! M I i 

2278 TGCAGATTTTGGCTTCAGCACGCAACGTTGACCGATCAGACAAGTTCCTT 2327 

2872 TATT 2875 
I I 1 

2328 TTTT 2331 
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t I * • 

QG CTGCT CCG C CAG CA AACC AGACAOVCAG CAGCCT ACCTCCCT 
ACCTAGCGTCCGCTTTCT 1 1 I 1 TTTCCTTTCGC C rC T C TT GC lT U CTCCGGCCGGCCACG 
TCGATAGCCGGCCACGGCCACCCACCTCGCGGTTGCGTCGCGTGCATCTGCGTGTGCGTA 
CCTG CTAGAGG CC CCCGTCTGCTTCCTCCGGG CAAGGAAGCaCCTTGCGGCGCTCGACCG 



MS OK XG V PARELPETPS 

ATGTCGGACAAAAAAGGGGTGCCCGCCCGGCAGCTGCCGGAGACGCCGTCCTCGGOGGTG 

I A*'V-V-*-'-V'v' r--'~. -A -AM V L V S V L H\ E H G L H K 
GCGGTG GTCTTCG CCG CC ATGGTGCTCGTGTCCGTCCTCATGGAAC ACCGCCTCCACAAG 

L G H W F Q H RHKKA LVEALEXM 
CTCGGCCAT^TGGTTCCAGCACCGGCACA^GAAGGCCCTGTGGCAGGCCCTGGAGAAGATG 

X A S |L**-. H" L - • V - G F - - I : S . L ; L'-v. t> -.1 - V^T^QUVD^Ed 
AAGGCGGAGCTCATGCTGGTGGGCTTCATATCCCTGCTCCTCATCGTCACGCAGGACCCC 

U gI?*A .'. *V HV C S| EDAADVHV PCXR 

ATCATCG CC AAGAT ATGC ATCTCCG AG G ATGCCGCCC ACGTCATG TGGCCCTG CAAGCG C 

GTEGRXPS KTVOTCPEGXVA 
GGCACCGAGGGCCGCAAGCCCAGCAAGTACGTTGACTACTGCCCGGAGGGCAAGGTGGCG 



l» M S T G S L H Q L H tV - T*: r-? ■■•» V^L'^-A ->V<H»rl 
CTCATGTCCACGGGCAGCTTGCACCAGCTCCACGTCTTCATCTTCGTGCTCGCGGTCTTC 

tHVTTSVI T lAtii S RLXHRTVK 
CATG7CACCTACAGCGTCATCACCATACCTCTAAGCCGTCTCAAAATGAGAACATGGAAG 

KWETETTS UETQFANDPARF 
AAAT G GG AG AC AG AG AC C AC CTC CTTG G AATACCAGTTCGCAAATG ATCCTG C ACG3 TTC 

RFTHQTS FVXRHLGLSSTPG 
CGGTTCACGCACCAGAC^TCCTTCGTGAAGCGCCACCTGGGCrTCTCCAGCACCCCTGGC 

I RWVVAF F RQFFRSVTXVDT 
ATCAGATGGGTGGTGGCCTTCT7CAGGCAGTTC7TCAGGTCAG7CACCAAGGTCGACTAC 

LTLRAG F I NAHLSQNSXFDF 
CTCACCTTG AGGG CAGGCTTCATCAACGCG CATTTG TCGCAAAACAGCAAGTTCG ACTTC 

HKYI X R S M EOOFX \ V V V . G ' - I-^S^T} 
CACAAGTACATCAAGAGGTCGATGGAGGACGACTTCAAGGTCGTCGTCCGCATCAGCCTC 

IPLV G VA I LTI.FI,] 0 I N G V G j'WLl 
CCGCTG TGGCG TG TGCCG ATCCTCACCCTCTTCCTTG ACATCAATCCGCTTCG CACGCTC 

11 V- I 'S F I P L V I L-L C V~g1 T X t, E M 
ATCTGGATTTCTTTCATCCCTC7CCTGATCCTCTTGTGTGTTGGAACCAAGCTGGAGATG 

I IMEMA L E IQ DRASVIXGAP 
ATCATCATGGAGATGGCCCTGGAGATCCAGGACCGGGCGAGCCTCATCAAGGGGGCCCCC 

V V E P S' N KFFWFHRPDWVLFF 
GTGG TCG AG CCCAGCAAC AAGTTCTTCTG GTTCCACCGCCCCGACTGGGTCCrCTTCTTC 

IH LTLFQNAFQMAHFVVTVA 
ATACACCTG ACGTTGTTC CAG AACG CGTTTCAGATGGCGCATTTTGTG TGGACAGTGGCC 



TPGLXKCY HTOIGLSIMX { V;^V| 
ArGCCCGGCTTGAAGAAATGCTACCACACGCAGATCGGGCTGAGCATCATGAAGGTGGTG 

G - A a Q F LCST'HTF P L vT ■ A^ vJ*v] 
GTGGGGCTAGCTCTCCAGTTCCTCTGCAGCTATATGACCT7CCCCCTCTACGCGCTCGTC 

(fjQMGSNM X R9 I rOEQTS XAL 
ACACAGATG GG ATCAAAC ATG AAGAGG TCC ATCTTCG ACGAGCAG ACGTCCAAGGCGCTC 



TNWRMTAKE j X X X V R| D T D M L M 
AC CAACT G G CG GAACACG G C CA AGG AG AAG AAGAAAG TCCGAG AC ACG G A CATGCTGATG 

AQMI GOATPS RGSSPMPSRG 
GC7CAGATGATCGGCG ACGCAACACCGAGCCGAGGCTCGTCGCCGATGCCGAGCCGGGGC 

SSPVQL LH XGMGRSDDPQSA 
TCATCACCCCTGCACCTGCTTCACAAGGGCATGGGGCGGTCGGACGACCCCCAGAGCGCG 

PTSPRTQQ EARQMYPVVVAH 
CCCACCTCGCCAAGGACCCAGCAGGAGGCTAGGGACATGTACCCCCTTGTGGTGGCCCAC 

PVHRLNPNORRRSASSSALE 
CCGCTGCACAGACTAAATCCTAACGACACGACCAGCTCCCCCTCGTCGTCGGCCCTCCAA 

AOtPSADFSrCQG' 
GCCGACATCCCCAGTCCACATTTTTCCTTCAGCCAGCGATCAGACAAGTTTCTGTATTCA 



TG TT AC T CC C A AT GT A T A GC C A A C A T A GG A TH TG ATG AT TCGT AC A_ATAA£A AAT AC AA T 
TTTTTACTG AGTC 
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1 GAATTCAATT AAGGACAAC A ACGGATGATA GGCTTAAGCT AGAGAGGATT 

51 CATATCGATT AATTAACTGT ACTTAAGTTG AGGTAAAACT CTATCGATTG 

101 CTTTGGACAC CGCCTCTCCC ATGATCTGCC AAGTTGAGCC GGCCTACCTA 

1S1 ATTTTCTTCG AAAGCACACA ACAAACGAAG GTAACCACTA ATCTAGACAC 

201 CACGCCTAAG TTATCAATTA CTACTCTAGT CTCGCGTAGA AACTTCATTC 

251 TTTATGGAGA GTGCTAGTAC TAGAGTACTT AATATAATAG TAAGCGACAA 

301 ACCCACGACG ATGAGAATGT ACCTCACTTA CGTAGTCAAT TAAGTCGAAA 

351 AGGAAATCTT GAACACTTAC TTTATTAAAG AAGTATTCCC CGAGGTACAG 

401 GAGAGGAGAG CACGCCAATA ACTCCAGCAC TCCTCCGAAA CCTTTCTCAC 

4S1 TCTCTACCCT TTTTCTCCAC ACAACTAAAA TGATGTCTAA TGTATGAAAG 

SOI TGAGTTGTAC TCTATTTTGT TGTGTGTTTG GAAGTGAAAT TAGCTCATCC 

5 51 TTTTATAGCA ACTTAATGCT CGGTTGTAGG TTGGTAATTA AGTCGGTAAA 

601 CACTCACAAC CACCATCGTC AAC CAATAGG AG ATCGC C AC ATG ATCG AAA. 

651 GCTGACAGTT AGGGGTG CCA ACCCTGTTTT GTCCGAACCA ACCAAACAAC 

7 01 CTCTATCTAG GACCTCTCTT CTATCTCTGA CAAGTCGGCC CATATGGCGG 

7 51 TGCACTATGG ATTAAGTCAA TTTCAGTCGT TTTGGACTGT CATGTGGGCC 

801 CTTCCAATCC TTGTGCTC CC ATATGATTGG TCGAAAGTAC ATTTAATTCC 

851 TGGGTGAGTG CT AG AAC T AA TATGATAGAT GTGCTCCGGC TCCTGGGAAA 

901 GAGGCCACTT GACATACTTG GGGTACTGCC CCAAGGGTAT TCCCTATCGC 

951 TTTTTCATAA TTTTCTCTCT CCAAAATC GG ACGGAAACAA TAAAAAAGAG 

1001 AGGCGATGTT CATCGGCAAA TATCTATTTT TTTGATAGTG TCTTCCCTTA 

10 51 AAACTTGATT TTTGCGAAGA CTTCCGGCTA AAAC C ATG AA ATCAGAGTTC 

1101 CTTGTAACAA ATTTAATTTG C CTAAA? AC A AAAAAGATCG AATGGAGATA 

1151 GCATTAAACT TGCTCCATAC GAATCATATT AGTTGGACCG TAACTCATAG 

1201 AAAAAGTTG C AAGTTGGTTG ACCTATCAAC CCTCTTATGT TGACCGTAAA 

12 Si cctc;ttatgc ATTAAGGATT AAGTACCGGC AGATCGTCAC TACTCACGAA 

1301 TGCACAAATT TCCGGTAACG TAGGATGGGA TGAGTTGGTC ACAAACGGGT 

13 51 CACCACGTCG CCCAACCTGC CGCGATCGAG CCATTGGCCG GCGATGCACG 
1401 CGCTTTGACA CAGCCGCCCG CCGCCCCCCG GCCCGCCCCC GTTTTTAATA 

14 Si AAAACCGGCC GCCCCCTGTC AAAGGTCTC A AAGTGTCAAG TGCATCAGAG 
1501 CTAACCTAGC GGTCACCCAG TCAGCTCACC CCGAGACGCA CCAGGGGATC 
IS 51 TATCGGATCA TGGCAGGTGG GAGATCGGGA TCGCGGGAGT TGCCGGAGAC 
1601 GCCGACGTGG GCGGTGGCCG TCGTCTGCGC CGTCCTCGTG CTCCTCTCCG 
1651 CCGCCATGGA GCACGGCCTC CACAACCTCA GC CATGTACG CGCGCGCGCA 
17 01 CGCGGTGTGC TCATCTCTCG AGTTAATTTG GTTGTTGTTG TTGTTGTGTT 
17 51 CTTGTGACAT CTC AATTAAC ATCCGATCGT GGTCGATCGA TCGCCCTGTG 
1801 GTGGC G AT AC TCCTTGCATT GCAGTGCTTC CGTAGGCGGC AGAAGAAGGC 
15 5 1 CATCX5GCGAC GCCCTCGACA AG ATC AAAGC AGGTCACCCT CAGCCTCAGC 
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TTTCCAATCC AATTCCAAAG CTGGCACTGA 
GATGAAGCAA TGGAAGAAGT GGGAGTCGGA 
AGTTCGCGAA TGGTCAGCTT CAACTTTTCT 
ACAACAAACG CACGCACGAT CAATCATCAC 
ACCGATTGAA TCCTCGCAGA TCCATCGCGG 
GACGTTGGTG AGGCGGCACC TGGGCCTCTC 
GGGTGGTGGC CTTCTTCAGG CAGTTCTTCA 
TACCTGACCT TGCGGCAGGG CTTCATCAAC 
CAGGTTCGAC TTCCACAAGT ACATCAAGAG 
AAGTCGTCGT CCGCATCAGG TACGCGCCAT 
ATACATCCAC CACCACATAG GTAGATAGAT 
CAAGTGCCGG TACGTACGTA CGTCTCATAT 
CTTGCCGCAG TCTCAAGCTC TGGTTCGTGG 
GATTTCGACG GTAGCCGCCT TGTCCATGCC 
TTCTCTCCAT AATTTGTGAA CTTGTCCCGT 
TCTTCTCGCA GGGATCGGCA CTCTTCTCTG 
TGGTAAGTCC ACAATTTGAA TAGACAACCT 
ACCTCCAAAC TTAATTAACA TGTCATTTGC 
TAGATCCTCT TGTGGGTTGG GACCAAGCTG 
GGCCCAGGAG ATCCATGACC GGGAGAGCGT 
TCGAGCCCAG CAACAAGTAC TTCTGGTTCA 
TTCCTCATGC ACCTCACACT CTTCCAGAAC 
CGTGTGGACA GTGGTACGTA CAAGTACTTG 
CCAACAAACG ACCCCAAATT AATGGTCCGT 
TTGGGGTAAA CGGACACAAA ACTCAATCCA 
GTTTTTCCGT ACGTTTTCGT CCGCTTTCGC 
TTGACGTTGT TGCATCGCAG GCCACGCCCG 
GAGAAAATGG CAATGAGCAT CGCCAAGGTC 
GATCTTGTGC AGCTACATCA CCTTCCCGCT 
TGGGCTCACA CATGAAGAGA AGCATCTTCG 
CTGACCAACT GGCGAAAGAT GGCCAAGGAG 
GGCCATGCTG ATGGCGCAGA TGGGCGGCGG 
CGTCGCCGGT GCACCTGCTC CACAAGGCCG 
CAGAGCGTGC CGGCGTCCCC GAGGGCCGAG 
GCATCCGGCG CGCAAGGTAC CTCCTTGTGA 
CGCCGGCGCT CGACGCTCAC ATCCCCGGTG 
CAACGTTGAC CGATCAGACA AGTTCCTTTT 
TATCATTTCA TTGATAGACA GTAGAAATTA 
CTATGTACAC AAGGGCACAG CAAAGGATCA 
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Figure 1 O 

1 ATGGCAGGTG GGAGATCGGG ATCGCGGGAG TTGCCGGAGA CGCCGACGTG 
SI GGCGGTGGCC GTCGTCTGCG CCGTCCTCGT GCTCGTCTCC GCCGCCATGG 
101 AGCACGGCCT CCACAACCTC AGCCATAAAA CCACCGCAGA AGTTCTCATA 
151 TTTCTTGTCC TATCTCCACT TGCAGAGCTG ATGCTGCTGG GCTTCATATC 
201 CCTGCTTCTC ACCGTGGCAC AGGCGCCCAT CTCCAAGATC TGCATCCCCA 
251 AGTCGGCTGC CAACATCTTG TTGCCGTGCA AGGCAGGC CA AGATGCCATC 
301 GAAGAAGAAG CAGCAAGTGG TCGCCGGTCC TTGGCCGGCG CCGGCGGCGG 
351 GGACTACTGC TCGAAATTCG ATGGCAAGGT GGCGCTGATG TCGGCAAAGA 
401 GC ATGCAC C A GCTGCACATT TTCATCTTCG TGCTCGCCGT GTTCCATGTT 
451 ACCTACTGCA TCATCACCAT GGGTTTAGGG CGCCTCAAAA TGAAGAAATG 
501 G AA GAAGTGG GAGTCACAGA CCAACTCATT GGAGTATCAG TTCGCAATCG 
S51 ATCCTTCACG ATTCAGGTTC AC GCATCAG A CGTCGTTCGT GAAGCGGCAT 
601 CTGGGATCAT TCTCAAG C AC CCCTGGGCr-C AGATGGATCG TAGCATTCTT 
651 CAGGCAGTTC TTTGGGTCCG TCACCAAGGT GG AC T AC C TG ACCATGCGGC 
701 AAGGCTTCAT C AATGC GC AT TTGTCGCAGA ATAGCAAGTT CGACTTCCAC 
751 AAATACATCA AGAGGTCTTT GGAGGACGAC TTCAAAGTTG TCGTTGGCAT 
801 CAGCCTCCCT CTGTGGTTCG TCGGAATCCT * TGTACTCTTC CTCGATATCC 
S51 ACGGTCTTGG CACACTTATT TGGATCTCTT TTGTTCCTCT CATCATCGTC 
901 TTGTTAGTTG GGACCAAGCT AC AG ATGGTG ATCATGGAGA TGGCCCAAGA 
951 GAT AC AGG AC AGGGCCACTG TGATCCAGGG AGCACCTATG GTTGAACCAA 
1001 GCAACAAGTA CTTCTGGTTC AACCGCCCTG ACTGGGTCTT GTTTTTCATA 
1051 CACCTGACAC TCTTCCATAA CGCATTTCAG ATGGCGCATT TCGTATGGAC 
1101 TATGGCAACA CCTGGTCTGA AGAAATGCTT CCATGAAAAT ATTTGGCTG A 
1151 GCATCGTGGA AGTCATTGTG GGGATCTCTC TTCAGGTGCT ATGCAGCTAC 
1201 ATCACCTTCC CGCTCTACGC GCTCGTC AC A C AGATGGG A T CGAACATGAA 
1251 GAAGACAATT TTCGAGGAGC AAACGATGAA GGCGCTGATG AACTGGAGGA 
1301 AGAAGGCGAT GGAGAAGAAG AAGGTCCGGG ACGCCGACGC GTTCCTGGCG 
13 51 CAGATGAGCG TCGACTTCGC GACGCCGGCG TCGAGCCGCT CCGCGTCGCC 
1101 GGTGC AC CTG CTGCAGGTCA CAGGGCGGGT CGCACGCCCG CCGAGCCCAA 
1151 TCACGGTGGC CTCACCACCG GCACCGGAGG AGGACATGTA CCCGGTGCCG 
1501 GCGGCGGCTG CGTCTCGCCA GCTGCTAGAC GAC CCGCCGG . ACAGGAGGTG 
15 SI GATGGCATCC TCGTCGGCCG ACATCGCCGA TTCTGATTTT TCCTTCAGCG 
1601 CACAACGGTG A 
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TCGGGAGCTG TCGGACACGC CGACGTGGGC 
TCATGATACT CGTCTCCGTC GCCATGGAGC 
CACTGGTTCC ACAACTGGCG CAAGAAGGCC 
GATCAAGGCG GAGCTCATGC TGGTGGGCTT 
TCACGCAGGA TCCCGTCTCC AGGATCTGCA 
AAGATGCTCC CGTGCAAGCC TTACGACGGC 
GGACAATCAC CGGAGGCTTC TCTGGCTCCA 
GCCGGTTCCT GGCTGCCCCG GCCGGAGTGG 
AAGGTGGCGC TGATGTCAGC GCCAAGCATG 
CTTCGTGCTC GCCGTCTTCC ACGTCTTGTA 
TAAGCCCTCT CAA£_VTGAAG CAATGGAAGA 
TCGCTGGAGT ATC AGTTCG C GAATG ATCC A 
CCAGACGACC- TTGGTGAGGC GGCACCTGGG 
TCAGATGGGT GGTGGCCTTC TTCAGGCAGT 
GTCG ACT AC C TGACCTTGCG GCAGGGCTTC 
CGGCAACAGG TTCGACTTCC AC AAG TAC AT 
ACTTCAAAGT CGTCGTCCGC ATCAGTCTCA 
CTCATCCTCT TCCTTGATTT CGACGGGATC 
CGTGGTTCCT CTCGTGATCC TCTTGTGGGT 
TGATCATGGA GATGGCCCAG GAGATCCATG 
GGTGCTCCCG CCGTCGAGCC CAGCAACAAG 
TGACTGGCTC CTCTTCCTCA TGCACCTCAC 
AGATGGCTCA TTTCGTGTGG AC AGTGGC C A 
TACCACGAGA AAATGGCAAT GAGCATCGCC 
CGCCCAGATC TTGTGCAGCT AC ATC ACCTT 
CGCAGATGGG CTCACACATC AAGAGAAGCA 
AAGGCGCTGA CCAACTGGCG AAAGATGGCC 
^GACGCGGCC ATGCTGATGG CGCAGATGGG 
TCGGCTCGTC GCCGGTGCAC CTGCTCCACA 
GACCCCCAGA GCGTCCCGGC GTCCCCGAGG 
CG T G C AG C AT CCCGCGCGCA AGGTACCTCC 
CCTCGTCGCC GGCGCTCGAC GCTCACATCC 
AGC AC GC AAC GTTCA 
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1 GTTGGTACAT AAAAGACTCT TCCTTTGTCT GTTTTTTGTT CCCAGATTCA 

51 TCTTTACTTA TTGACTAAAT TCTCTCTGGT GTGAGAAGTA AAATGGGTCA 

101 CGGAGGAGAA GGGATGTCGC TTGAATTCAC TCCGACGTGG GTCGTCGCCG 

151 GAGTTTGTAC GGTCATCGTC GCGATTTCAC TGGCGGTGGA GCGTTTGCTT 

201 CACTATTTCG GTACTGTTCT TAAGAAGAAG AAGCAAAAAC CCCTTTACGA 

251 AGCCCTTCAA AAGGTTAAAG AAGAGCTGAT GTTGTTAGGG TTTATATCGC 

301 TGTTACTGAC GGTATTCCAA GGGCTCATTT CCAAATTCTG TGTGAAAGAA 

351 AATGTGCTTA TGCATATGCT TCCATGTTCT CTCGATTCAA GACGAGAAGC 

401 TGGGGCAAGT GAACATAAAA ACGTTACAGC AAAAGAACAT TTTCAGACTT 

451 TTTTACCTAT TGTTGGAACC ACTAGGCGTC TACTTGCTGA ACATGCTGCT 

501 GTGCAAGTTG GTTACTGTAG CGAAAAGGGT AAAGTACCAT TGCTTTCGCT 

551 TGAGGCATTG CACCATCTAC ATATTTTCAT CTTCGTCCTC GCCATATCCC 

601 ATGTGACATT CTGTGTCCTT ACCGTGATTT TTGGAAGCAC AAGGATTCAC 

I 651 CAATGGAAGA AATGGGAGGA TTCGATCGCA GATGAGAAGT TTGACCCCGA 

' 701 AACAGCTCTC AGGAAAAGAA GGGTCACTCA TGTACACAAC CATGCTTTTA 

j 751 TTAAAGAGGA TTTTCTTGGT ATTGGCAAAG ATTCAGTCAT CCTCGGATGG 

j 801 ACGCAATCCT TTCTCAAGCA ATTCTATGAT TCTGTGACGA AATCAGATTA 

851 CGTGACTTTA CGTCTTGGTT TCATTATGAC ACATTGTAAG GGAAACCCCA 

-. 901 AGCTTAATTT CCACAAGTAT ATGATGCGCG CTCTAGAGGA TGATTTCAAA 

J 951 CAAGTTGTTG GTATTAGTTG GTATCTTTGG ATCTTTGTCG TCATCTTTTT 

1001 GCTGCTAAAT GTTAACGGAT GGCACACATA TTTCTGGATA GCATTTATTC 

'i 1051 CCTTTGCTTT GCTTCTTGCT GTGGGAACAA AGTTGGAGCA TGTGATTGCA 

1101 CAGTTAGCTC ATGAAGTTGC AGAGAAACAT GTAGCCATTG AAGGAGACTT 

1151 AGTGGTGAAA CCCTCAGATG AGCATTTCTG GTTCAGCAAA CCTCAAATTG 

1201 TTCTCTACTT GATCCATTTT ATCCTCTTCC AGAATGCTTT TGAGATTGCG 

1251 TTTTTCTTTT GGATTTGGGT TACATACGGC TTCGACTCGT GCATTATGGG 

1301 ACAGGTGAGA TACATTGTTC CAAGATTGGT TATCGGGGTC TTCATTCAAG 

1351 TGCTTTGCAG TTACAGTACA CTGCCTCTTT ACGCCATCGT CTCACAGATG 

1401 GGAAGTAGCT TCAAGAAAGC TATATTCGAG GAGAATGTGC AGGTTGGTCT 

1451 TGTTNGTTGG GCACAGAAAG TGAAACAAAA GAGAGACCTA AAAGCTGCAG 

1501 CTAGTAATGG AGACGAAGGA AGCTCTCAGG CTGGTCCTGG TCCTGATTCT 

1551 GGTTCTGGTT CTGCTCCTGC TGCTGGTCCT GGTGCAGGTT TTGCAGGAAT 

1601 TCAGCTCAGC AGAGTAACAA GAAACAACGC AGGGGACACA AACAATGAGA 

1651 TTACACCTGA TCATAACAAC TGAGCAGAGA TATTATCTTT TCCATTTAGA 

1701 GGATCATCAT CAGATTTTAG CTTCAAGGTC CGGTTTTGTG GTTTATACAT 

1751 AAGTTATAGT GACTTGATTT TTTTGTTTTG TTACAAAGTT ACCATCTTTG 

1801 GATTAGAATT GGGAAATTGA ATCTGTTTGT ATATTGTATT ATTTGGAACA 

1851 TTGTGGATGC CCATGGATAT GTTTCTGTTC 
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1 MAGGRSGSRE LPETPTWAVA WCAVLVLVS AAMEHGLHNL SHKTTAEVLI 
51 FLVLSALAEL MLLGFISLLL TVAQAPISKI CIPKSAANIL LPCKAGQDAI 

101 EEEAASGRRS LAGAGGGDYC . SKFDGKVALM SAKSMHQLHI FIFVTiAVFHV 

1S1 TYCIITMGLG RLKMKKWKKW ESQTNSLEYQ FAIDPSRFRF THQTS FVKRK 

201 LGSFSSTPGL RWIVAFFRQF FGSVTKVDYL TMRQGFINAH LSQNSKFDFE 

251 KYI KRS LEDD FKWVGISLP LWFVGILVLF LDIHGLGTLI WISFVPLIIV 

301 LLVGTKLEMV IMEMAQEIQD RATVI QGAPM VEPSNKYFWF NRPDWVLFFI 

351 HLTLFHNAFQ MAHFVWTMAT PGLKKCFHEN IWLSIVEVIV GISLQVLCSY 

4 01 ITFPLYAIiVT QMGSNMKKTI FEEQTMKALM NWRKKAMEKK KVRD AD AF LA 

451 QMS VD FAT PA SSRSASPVKL LQVTGRVGRP PSPITVASPP APEEDMYPVP 

501 AAAASRQIiIiD DPPDRRWMAS SSADIADSDF SFSAQR* 
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Figure 1 5 
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